Semantically Linking and Browsing Provenance Logs for E-science
نویسندگان
چکیده
e-Science experiments are those performed using computerbased resources such as database searches, simulations or other applications. Like their laboratory based counterparts, the data associated with an e-Science experiment are of reduced value if other scientists are not able to identify the origin, or provenance, of those data. Provenance is the term given to metadata about experiment processes, the derivation paths of data, and the sources and quality of experimental components, which includes the scientists themselves, related literature, etc. Consequently provenance metadata are valuable resources for e-Scientists to repeat experiments, track versions of data and experiment runs, verify experiment results, and as a source of experimental insight. One specific kind of in silico experiment is a workflow. In this paper we describe how we can assemble a Semantic Web of workflow provenance logs that allows a bioinformatician to browse and navigate between experimental components by generating hyperlinks based on semantic annotations associated with them. By associating well-formalized semantics with workflow logs we take a step towards integration of process provenance information and improved knowledge discovery.
منابع مشابه
Annotating, linking and browsing provenance logs for e-Science
Like experiments performed at a laboratory bench, the results of an e-science in silico experiment are of limited value if other scientists are not able to identify the origin, or provenance, of those results. For e-Science, we need more systematic provenance logs across a range of eScience activities and disciplines as well as a more informed understanding of the information in these provenanc...
متن کاملSemantically linking web pages to web services in Bioinformatics
A key application area of semantic technologies is the fast-developing field of bioinformatics. Sealife is a project within this field with the aim of creating semanticsbased web browsing capabilities for the life sciences. This includes meaningfully linking significant terms from the text of a web page to executable web services. This requires the semantic mark-up of biological terms, linking ...
متن کاملEnabling Semantic Analysis of User Browsing Patterns in the Web of Data
A useful step towards better interpretation and analysis of the usage patterns is to formalize the semantics of the resources that users are accessing in the Web. We focus on this problem and present an approach for the semantic formalization of usage logs, which lays the basis for effective techniques of querying expressive usage patterns. We also present a query answering approach, which is u...
متن کاملAn Identity Crisis in the Life Sciences
Grid is an e-Science project assisting life scientists to build workflows that gather and co-ordinate data from distributed, autonomous, replicated and heterogeneous resources. The provenance logs of workflow executions are recorded as RDF graphs. The log of one workflow run is used to trace the history of its execution process; however, by aggregating provenance logs of workflow reruns, or run...
متن کاملProvenance Management in Practice
Scientific Workflow Managements Systems (SWfMSs), such as our own research prototype e-BioFlow, are being used by bioinformaticians to design and run data-intensive experiments, connecting local and remote (Web) services and tools. Preserving data, for later inspection or reuse, determine the quality of results. To validate results is essential for scientific experiments. This can all be achiev...
متن کامل